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Let  F and  G be  two  distribution  functions.  Under  certain 

circumstances  there  exists  a function  h:^0,l)  -*■  {0,1)  such  that  G = hF  .j 

Steck  and  Zimmer  originally  proposed  the  "model"  G = hF  for  possible  use 

v 

in  accelerated  life  testing.  -'In  that  case  G and  F can  be  taken  to  be, 
respectively,  the  distribution  functions  of  an  ''object"*  under  normal 
conditions  and  under  conditions  causing  accelerated  aging.  Presumably  one 
would  have  a fairly  good  idea  of  F from  testing;  an  estimate  of  h would 
then  provide  an  estimate  of  G.  It  is  also  possible  that  h might  be 
determined  from  theoretical  considerations  in  which  case  an  estimate  of  F 
would  provide  an  estimate  of  G.  Finally,  it  is  possible  that  estimates  of 

h could  provide  insights  into  the  aging  process  itself.  

The  model  G = hF  is  "obviously"  not  restricted  to  use  in  accelerated 
life  testing  but  is  potentially  applicable  wherever  two  distributions  are 
"naturally"  related  to  one  another.  It  could,  for  example,  be  used  to 
investigate  human  response  times  under  different  sets  of  conditions  (e.g., 
different  accelerations),  in  learning  theory,  to  investigate  the  effects  of 
collaboration  on  the  ability  to  perform  tasks  or  on  test  score  distributions, 
etc. 


Suppose  F and  G are  continuous  and  that  they  are  strictly  increasing 

on  the  sets  F ^((0,1))  and  G ^((0,1)),  respectively.  Then  a function  h 

such  that  G - hF  exists  if  and  only  if  G ^((0,1))  CZ  F ^((0,1)),  and  in 

this  case  h(y)  - G[F  ^(y)]  for  every  y in  (0,1). 

An  estimator  suggested  by  Steck  and  Zimmer  for  estimating  h is  obtained 

in  the  following  way.  Let  X , •••,X  ,Y. ,*’*,Y  be  an  independent  collection 

1 id  I n 

that 

of  random  variables  such. the  X. 's  have  distribution  function  F and  the  Y 's 

A i j 

have  distribution  function  G.  Order  the  combined  sample  using  some  "rule"  to 


2 


determine  order  in  case  of  ties.  Consider  the  random  walk  from  (0,0)  to 
(1,1)  which,  after  i+k  steps,  is  at  the  point  (i/m,  k/n)  if  i of  the 
first  i+k  combined  order  statistics  are  X's  and  the  other  k of  them 
are  Y's.  The  j-th  step  of  this  random  walk  goes  to  the  right  an  amount 
1/m  (up  an  amount  1/n)  if  the  j-th  combined  order  statistic  is  an  X 
(a  Y) . The  estimator  of  Zimmer  and  Steck,  h,  is  the  right  continuous 
function  obtained  from  the  "path"  of  this  random  walk  and  extended  so  that 
h(x)  =0  if  x < 0 and  h(x)  =1  if  x > 1. 

If  h exists  and  F is  continuous,  then  in  making  probability  compu- 

A 

tations  about  h it  suffices  to  assume  that  the  X^'s  are  uniformly  distri- 
buted on  (0,1)  and  that  G * h. 

Steck  and  Zimmer  suggested  using  confidence  regions  for  h of  the  form 

S,  * S,  (m,n,e,6)  * {h(x-e)-6  < h(x)  < h(x+e)+6  for  all  x} 
n n — » — 

= {h(x-e)-6  h(x)  h(xfe)+6  for  all  x). 

Unfortunately , the  probability  of  such  a region  depends  on  h and,  even  if 

one  knows  h is  not  easily  computed.  Let  a ■ P(D  < e)  > P{sup|F  (x)-F(x)|<e} 

x 

and  c»G  = P(Dn  j 5)  > P{sup  | Gn(x)-G(x)  |_<6 } be  the  usual  coverage  probabilities 

x 

of  the  standard  Kolmogorov-Smimov  confidence  regions.  Let  P.  denote  a 

h 

probability  computed  under  the  assumption  that  G = hF  is  true.  Steck  and 

Zimmer  showed  that  P,  (S,  ) > a *a.  for  all  h.  They  also  pointed  out  that 

n n — r G 

this  bound  is  too  conservative  to  be  of  any  practical  use.  They  conjectured 
that 

(A)  P.  (S.)  > 1 - (1  - a ) (1  - a„)  for  all  h,  and 

n n — f 

1/2  1/2 

(B)  ifm  e»n  6<1  then  P,  (S,  ) is  minimized  when  h(x)  » x for 

n h 


all  x in  (0,1). 
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We  have  counterexamples  to  both  of  these  conjectures.  The  counterexample  to 

conjecture  (B)  is  also  an  example  illustrating  the  "discreteness  effect" 

whereby  a small  change  in  h can  produce  a fairly  sizeable  change  in  the 

coverage  probability  by  excluding  a large  number  of  sample  paths  in  the  second 

representation  for  S^.  We  have  had  no  success  to  date  in  producing  usable 

lower  bounds  for  P,  (S,  ) . We  know  quite  a bit  more  about  what  won't  work, 
h h 

Other  estimators  (besides  h)  and  other  confidence  regions  (besides  S,  ) 

h 

have  been  proposed  and  investigated,  mainly  in  an  effort  to  "remove  comers" 
or  smoothen  things  out;  these  investigations  have  been  of  a preliminary  nature; 
no  theoretical  results  have  been  obtained  and  only  preliminary  simulation  work 
has  been  done. 

Asymptotic  results  have  been  obtained.  Under  suitable  regularity 
conditions  it  has  been  shown  that 


B. (h ( t ) ) 


P{S^(m,n,e/i^,5/i^n)  } - P{-1  <_  1 — + h' (t) 


B2(t)  r 7~ 

) — jyjT  /[6/m  + h'(t)e/m]  <_  1} 


converges  to  zero  uniformly  as  n,m  -*■  ®.  and  are  independent  Brownian 

bridges;  B^  has  undergone  a time  shift.  No  work  has  been  done  on  rates  of 
convergence  and  no  progress  has  been  made  on  providing  a uniform  bound  on  the 
probabilities  given  in  terms  of  the  two  Brownian  bridges.  Note  that  we  can 
replace  m and  n by  their  ratio. 

Under  suitable  regularity  conditions,  h is  concave  if  and  only  if 
G'(t)/F' (t)  is  non-decreasing.  Thinking  of  G'  and  F'  as  the  fatality 
rates  in  the  G and  F populations,  respectively,  this  says  that  earlier 
(in  time)  the  higher  fatality  rate  is  in  the  F population,  later  the  higher 
fatality  rate  is  in  the  G population,  and  as  time  progresses  the  fatality 


rate  shifts  from  the  F population  to  the  G population.  This  seems  to  be 
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a reasonable  assumption  in  some  situations.  The  estimator  which  goes  through 
the  points  (0,0)  and  (1,1),  which  is  concave,  and  which  is  (in  a sense)  the 
best  least  squares  fit  to  Steck's  estimator  subject  to  the  conditions  just  men- 
tioned, would  seem  to  be  a logical  choice  for  estimating  concave  h's. 
Computing  this  estimator  for  specific  data  is  a quadratic  programming  problem. 
Several  methods  of  computation  were  available  but  none  seemed  suitable  for  a 
simulation  study  in  which  many  estimators  must  be  computed.  A class  of 
estimators  has  been  produced  each  of  which  is  a "cross"  between  Hildreth's 
procedure  and  "optimization"  holding  certain  coordinates  fixed  on  "the 
boundary."  It  has  been  proved  that  each  procedure  in  this  class  produces 
the  desired  estimator. 


